AITopics | figure 4

Collaborating Authors

figure 4

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How Width and Data Shape Generalization Scaling Laws in Quadratic Neural Networks

Girardin, Julius, Troiani, Emanuele, Xu, Yizhou, Erba, Vittorio, Krzakala, Florent, Zdeborová, Lenka

arXiv.org Machine LearningJun-29-2026

Understanding how performance scales jointly with model size and data is a central problem in modern machine learning. Existing theoretical works on scaling laws typically describe generalization as a function of data or compute, often in fixed-feature or infinite-width regimes and for online SGD. Here, we instead study how generalization scales with the number of trainable parameters and the number of samples in a feature-learning model. We analyze $\ell_2$-regularized empirical test error minimization in a quadratic two-layer network in a finite-sample setting with structured data. This setting allows for an explicit characterization of the generalization error as a function of the number of samples, model width, and regularization. Our results reveal a phase diagram with distinct scaling regimes as the number of parameters varies. In particular, the generalization error follows data-dependent power laws controlled by the spectral structure of the target. We further characterize the transitions between regimes, including the onset of interpolation, and their impact on generalization.

artificial intelligence, defilippisetal, machine learning, (17 more...)

arXiv.org Machine Learning

2606.28242

Country: North America > United States (0.14)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Disentangling Misreporting from Genuine Adaptation in Strategic Settings: ACausal Approach

Neural Information Processing SystemsJun-21-2026, 09:46:18 GMT

In settings where ML models are used to inform the allocation of resources, agents affected by the allocation decisions might have an incentive to strategically change their features to secure better outcomes. While prior work has studied strategic responses broadly, disentangling misreporting from genuine adaptation remains a fundamental challenge. In this paper, we propose a causally-motivated approach to identify and quantify how much an agent misreports on average by distinguishing deceptive changes in their features from genuine adaptation. Our key insight is that, unlike genuine adaptation, misreported features do not causally affect downstream variables (i.e., causal descendants). We exploit this asymmetry by comparing the causal effect of misreported features on their causal descendants as derived from manipulated datasets against those from unmanipulated datasets. We formally prove identifiability of the misreporting rate and characterize the variance of our estimator. We empirically validate our theoretical results using a semi-synthetic and real Medicare dataset with misreported data, demonstrating that our approach can be employed to identify misreporting in real-world scenarios.

causal effect, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services > Reimbursement (1.00)
Health & Medicine > Government Relations & Public Policy (1.00)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Neural Information Processing SystemsJun-19-2026, 18:31:11 GMT

Finetuning provides a scalable and cost-effective means of customizing language models for specific tasks or response styles, with greater reliability than prompting or in-context learning. In contrast, the conventional wisdom is that injecting knowledge via finetuning results in brittle performance and poor generalization. We argue that the dichotomy of "task customization" (e.g., instruction tuning) and "knowledge injection" (e.g., teaching new facts) is a distinction without a difference. We instead identify concrete factors that explain the heterogeneous effectiveness observed with finetuning. To this end, we conduct a large-scale experimental study of finetuning the frontier Gemini v1.5 model family on a spectrum of datasets that are artificially engineered to interpolate between the strengths and failure modes of finetuning. Our findings indicate that question-answer training data formats provide much stronger knowledge generalization than document/articlestyle training data, numerical information can be harder for finetuning to retain than categorical information, and models struggle to apply finetuned knowledge during multi-step reasoning even when trained on similar examples--all factors that render "knowledge injection" to be especially difficult, even after controlling for considerations like data augmentation and information volume. On the other hand, our findings also indicate that it is not fundamentally more difficult to finetune information about a real-world event than information about writing style.

information, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Leisure & Entertainment > Sports (0.93)
Law (0.93)
Media (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

Clustering based on Stochastic Dominance with application for risk averters and risk seekers

Li, Hua, Jia, Xue, Kang, Yilin, Wong, Wing-Keung

arXiv.org Machine LearningMay-26-2026

Stock clustering algorithms play a pivotal role in quantitative finance and the asset management industry, serving as a core mechanism for understanding market complexity and conducting asset preselection. Their intrinsic value lies in enabling investors to identify the true underlying structure of the stock market, thereby categorizing stocks with similar return characteristics or risk profiles into distinct groups. This data-driven market segmentation not only significantly reduces the computational dimensionality involved in portfolio construction but also provides a solid foundation for formulating differentiated investment strategies. A review of existing literature reveals that scholars both domestic and international have achieved fruitful results in stock clustering. Traditional clustering research predominantly employs classic machine learning algorithms: Xiaojun (2019) and Wu et al. (2022) utilized the K-means algorithm for stock partitioning; Huang et al. (2010) and Lu et al. (2020) explored the sectoral structures of the SSE 50 Index and other markets based on Agglomerative Hierarchical Clustering (AHC) and Spectral Clustering; Korzeniewski (2018) further introduced the Partitioning Around Medoids (PAM) algorithm to construct portfolios with enhanced risk resistance. In recent years, with the advancement of deep learning, L ucio and Caiado (2022) and Siregar and Yosia (2024) have attempted to incorporate time-series models (such as TGARCH) or specific market features (e.g., Indonesian stock data) into clustering frameworks. However, despite their respective merits in capturing market trends, these methods share a common limitation: traditional stock clustering approaches predominantly rely exclusively on stock-specific information (e.g., price, volatility, or financial metrics), neglecting the heterogeneity of market participants--namely, the "investors". In reality, investors are typically categorized into three distinct types based on their risk preferences: risk-averse, risk-seeking, and risk-neutral. Divergent risk attitudes inevitably lead to fundamentally different asset selection logic.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

2605.24422

Country:

Asia > China (1.00)
North America > United States > New York (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Coupling-Informed Transport Maps for Bayesian Filtering in Nonlinear Dynamical Systems

Zeng, Dengfei, Jiang, Lijian, Sun, Shuyu, Xiao, Dunhui

arXiv.org Machine LearningMay-14-2026

A likelihood-free transport filtering method is proposed based on the couplings between state and observation variables. By exploiting a block-triangular structure in the transport map, the analysis step of filtering is reformulated as the minimization of the maximum mean discrepancy (MMD) between the true joint measure and its transport-based approximation. To circumvent the non-convexity in the MMD optimization, we introduce a training-free transport filter method via gradient flows, which leads to an analytic computation for the transport map that implies the steepest descent direction of the MMD. The proposed approach accurately approximates non-Gaussian filtering posteriors and avoids particle collapse. We provide a convergence analysis for the expectation of the MMD between the approximated posterior and the truth posterior. Finally, we extend the method to high-dimensional problems through domain localization. Numerical examples demonstrate the superior performance of our approach over conventional filtering methods in nonlinear, non-Gaussian scenarios.

artificial intelligence, machine learning, transport map, (15 more...)

arXiv.org Machine Learning

2605.13174

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)

Add feedback

Threshold Learning for Optimal Decision Making

Nathan F. Lepora

Neural Information Processing SystemsMay-1-2026, 05:45:59 GMT

Decision making under uncertainty is commonly modelled as a process of competitive stochastic evidence accumulation to threshold (the drift-diffusion model). However, it is unknown how animals learn these decision thresholds. We examine threshold learning by constructing a reward function that averages over many trials to Wald's cost function that defines decision optimality. These rewards are highly stochastic and hence challenging to optimize, which we address in two ways: first, a simple two-factor reward-modulated learning rule derived from Williams' REINFORCE method for neural networks; and second, Bayesian optimization of the reward function with a Gaussian process. Bayesian optimization converges in fewer trials than REINFORCE but is slower computationally with greater variance. The REINFORCE method is also a better model of acquisition behaviour in animals and a similar learning rule has been proposed for modelling basal ganglia function.

artificial intelligence, machine learning, threshold, (17 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Industry:

Education (0.85)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count

Neural Information Processing SystemsApr-29-2026, 23:07:01 GMT

We present the HOH (Human-Object-Human) Handover Dataset, a large object count dataset with 136 objects, to accelerate data-driven research on handover studies, human-robot handover implementation, and artificial intelligence (AI) on handover parameter estimation from 2D and 3D data of two-person interactions. HOH contains multi-view RGB and depth data, skeletons, fused point clouds, grasp type and handedness labels, object, giver hand, and receiver hand 2D and 3D segmentations, giver and receiver comfort ratings, and paired object metadata and aligned 3D models for 2,720 handover interactions spanning 136 objects and 20 giver-receiver pairs--40 with role-reversal--organized from 40 participants. We also show experimental results of neural networks trained using HOH to perform grasp, orientation, and trajectory prediction. As the only fully markerless handover capture dataset, HOH represents natural human-human handover interactions, overcoming challenges with markered datasets that require specific suiting for body tracking, and lack high-resolution hand tracking. To date, HOH is the largest handover dataset in terms of object count, participant count, pairs with role reversal accounted for, and total interactions captured.

artificial intelligence, interaction, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.68)
Europe (0.46)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.35)

Add feedback

Worst-case Performance of Popular Approximate Nearest Neighbor Search Implementations: Guarantees and Limitations

Neural Information Processing SystemsApr-29-2026, 20:34:58 GMT

Graph-based approaches to nearest neighbor search are popular and powerful tools for handling large datasets in practice, but they have limited theoretical guarantees. We study the worst-case performance of recent graph-based approximate nearest neighbor search algorithms, such as HNSW, NSG and DiskANN. For DiskANN, we show that its "slow preprocessing" version provably supports approximate nearest neighbor search query with constant approximation ratio and poly-logarithmic query time, on data sets with bounded "intrinsic" dimension. For the other data structure variants studied, including DiskANN with "fast preprocessing", HNSW and NSG, we present a family of instances on which the empirical query time required to achieve a "reasonable" accuracy is linear in instance size. For example, for DiskANN, we show that the query procedure can take at least 0.1n steps on instances of size nbefore it encounters any of the 5nearest neighbors of the query.

artificial intelligence, information retrieval, natural language, (18 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

When Expressivity Meets Trainability: Fewer than n Neurons Can Work

Neural Information Processing SystemsApr-25-2026, 19:26:22 GMT

Modern neural networks are often quite wide, causing large memory and computation costs. It is thus of great interest to train a narrower network. However, training narrow neural nets remains a challenging task. We ask two theoretical questions: Can narrow networks have as strong expressivity as wide ones? If so, does the loss function exhibit a benign optimization landscape?

artificial intelligence, machine learning, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Asia > China (0.29)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

figure 4

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

How Width and Data Shape Generalization Scaling Laws in Quadratic Neural Networks

Disentangling Misreporting from Genuine Adaptation in Strategic Settings: ACausal Approach

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Clustering based on Stochastic Dominance with application for risk averters and risk seekers

Coupling-Informed Transport Maps for Bayesian Filtering in Nonlinear Dynamical Systems

Threshold Learning for Optimal Decision Making

HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count

Worst-case Performance of Popular Approximate Nearest Neighbor Search Implementations: Guarantees and Limitations

cfa45151ccad6bf11ea146ed563f2119-Supplemental.pdf

When Expressivity Meets Trainability: Fewer than n Neurons Can Work